3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
I don't know
Size:
12976 entries Production Status:
Existing-used
Use:
Automated Essay Scoring
-
Paper title:Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring
-
Paper track:Short/NLP Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Haoran Zhang | Automated Student Assessment Prize | /N |
Documentation:
Yes, https://www.kaggle.com/c/asap-aes/data
Written
Corpus,
Language Type:
Bilingual
Languages:
English Sinhala
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Arya D. McCarthy | flores | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English French
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Arya D. McCarthy | machine translation for noisy text | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Arya D. McCarthy | wmt14 data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English Romanian
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Arya D. McCarthy | wmt16 data | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International
Size:
3285 entries Production Status:
Newly created-finished
Use:
Word Sense Disambiguation
-
Paper title:Analysing Lexical Semantic Change with Contextualised Word Representations
-
Paper track:Long/Semantics: Lexical
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mario Giulianelli | DUPS: Diachronic Usage Pair Similarity | /N |
Documentation:
The documentation for DUPS is available at https://doi.org/10.5281/zenodo.3773250.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Analyzing Political Parody in Social Media
-
Paper track:Long/Computational Social Science and Social Media
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nikolaos Aletras | Twitter Parody Data | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
1 MByte Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:The Sensitivity of Language Models and Humans to Winograd Schema Perturbations
-
Paper track:Long/Theme
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mostafa Abdou | Enhanced Winograd Schema Challenge | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Afrikaans Albanian Amharic Arabic Aragonese Armenian Assamese Azerbaijani Basque Belarusian Bengali Bosnian Breton Bulgarian Burmese Catalan Central Khmer Chinese Croatian Czech Danish Dutch Dzongkha English Esperanto Estonian Finnish French Gaelic Galician Georgian German Greek Gujarati Hausa Hebrew Hindi Hungarian Icelandic Igbo Indonesian Irish Italian Japanese Kannada Kazakh Kinyarwanda Korean Kurdish Kyrgyz Latvian Limburgan Lithuanian Macedonian Malagasy Malay Malayalam Maltese Marathi Mongolian Nepali Northern Sami Norwegian Norwegian Bokmål Norwegian Nynorsk Occitan Oriya Panjabi Pashto Persian Polish Portuguese Romanian Russian Serbian Serbo-Croatian Sinhala Slovak Slovenian Spanish Swedish Tajik Tamil Tatar Telugu Thai Turkish Turkmen Uighur Ukrainian Urdu Uzbek Vietnamese Walloon Welsh Western Frisian Xhosa Yiddish Yoruba Zulu
Availability:
Freely Available
License:
Size:
55 million sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Biao Zhang | the open parallel corpus (OPUS) | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
4000 entries Production Status:
Newly created-in progress
Use:
Parsing and Tagging
-
Paper title:A Methodology for Creating Question Answering Corpora Using Inverse Data Annotation
-
Paper track:Long/Question Answering
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jan Deriu | OTTA | /N |
Documentation:
None




